data processing workflow (Mestrelab Research)
Structured Review

Data Processing Workflow, supplied by Mestrelab Research, used in various techniques. Bioz Stars score: 86/100, based on 1 PubMed citations. ZERO BIAS - scores, article reviews, protocol conditions and more
https://www.bioz.com/result/data processing workflow/product/Mestrelab Research
Average 86 stars, based on 1 article reviews
Images
1) Product Images from "Accelerating natural product discovery with linked MS-genomics and language/transformer-based models"
Article Title: Accelerating natural product discovery with linked MS-genomics and language/transformer-based models
Journal: npj Antimicrobials and Resistance
doi: 10.1038/s44259-026-00206-7
Figure Legend Snippet: Putative producers of compounds of interest are prioritized through cross-validation of the top ranked strains via Protein Language Model (PLM) embeddings and via our in-house developed Workflow for Intelligent Structural Elucidation (WISE).
Techniques Used: Biomarker Discovery
Figure Legend Snippet: A Overview of the cheminformatics analysis workflow (WISE) to annotate individual molecules from raw liquid chromatography-tandem mass spectral (LC-MS/MS) data of fermentation extracts. B Distribution of WISE scores for correct structures (Real) versus isobaric decoy structures (within 0.005 m/z) with different molecular formulae (Decoy) from the Critical Assessment for Small Molecule Identification (CASMI2022) dataset. Low and high WISE score thresholds of 0.35 and 0.65 respectively are annotated for reference. C Receiver Operating Characteristic (ROC) curve demonstrating the discrimination of correct structures from decoys with differing molecular formulae within a 0.005 m/z window. AUC = Area Under Curve. CI = Confidence Interval. Detailed data in Table .
Techniques Used: Liquid Chromatography, Liquid Chromatography with Mass Spectroscopy
Figure Legend Snippet: A Elucidated biosynthetic gene cluster BGC0000709, responsible for neomycin production. B Similarity (%) of each embedded protein and sequence similarity (%) of the top-scoring protein matches. C Predicted protein structures of the top matches for Fe-S oxidoreductase ( CAF33317 ) from neomycin BGC. D Heatmap of WISE scores for strain-media combinations derived from the liquid chromatography-tandem mass spectrometry (LC-MS/MS) data of fermentation extracts. Only strains with at least 1 media combination with non-zero WISE score are shown. Activated strains (if any) are referred to by their parent strain instead of individually (details of the activated strains are in Table ). E Structure of neomycin B. F Workflow of the co-occurrence strategy and matrix ranking table with 0.35 WISE score and 0.75 PLM score cutoff. Strains that have been cross validated are highlighted in green or yellow, indicating whether these compounds were present (green) or absent (yellow). Lines indicating higher confidence thresholds of 0.65 WISE score (in red) and 2.5 PLM score (in blue) are also included.
Techniques Used: Sequencing, Derivative Assay, Liquid Chromatography, Mass Spectrometry, Liquid Chromatography with Mass Spectroscopy